Get plot data for prepostfit experiments #438

lpoug · 2025-02-07T11:14:33Z

Closes Get model results data #418
Allows access to the data of PrePostFit experiments through the use of .get_plot_data() function

Note: First time doing a proper PR so not sure if all pre-requisites are here.

📚 Documentation preview 📚: https://causalpy--438.org.readthedocs.build/en/438/

drbenvincent · 2025-02-27T21:13:04Z

Humble apologies for taking so long to getting to this PR @lpoug. I've unfortunately not had much time to spend on CausalPy as I'd have liked, but hoping to catch up with the backlog.

There are currently a couple of issues with the remote checks. I'm hoping to get these resolved in #437, at which point I'll test this out locally and give feedback if necessary before we can merge this :)

codecov · 2025-02-27T21:19:08Z

Codecov Report

Attention: Patch coverage is 96.00000% with 4 lines in your changes missing coverage. Please review.

Project coverage is 94.53%. Comparing base (2a6f9db) to head (da6c91d).
Report is 39 commits behind head on main.

Files with missing lines	Patch %	Lines
causalpy/experiments/base.py	84.21%	3 Missing ⚠️
causalpy/experiments/prepostfit.py	97.05%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #438      +/-   ##
==========================================
+ Coverage   94.40%   94.53%   +0.12%     
==========================================
  Files          31       31              
  Lines        1985     2068      +83     
==========================================
+ Hits         1874     1955      +81     
- Misses        111      113       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

lpoug · 2025-03-03T13:37:35Z

Humble apologies for taking so long to getting to this PR @lpoug. I've unfortunately not had much time to spend on CausalPy as I'd have liked, but hoping to catch up with the backlog.

There are currently a couple of issues with the remote checks. I'm hoping to get these resolved in #437, at which point I'll test this out locally and give feedback if necessary before we can merge this :)

Absolutely no problem whatsoever @drbenvincent! Let me know when the time comes, I'll be around 😄

drbenvincent · 2025-03-03T20:04:10Z

Hi @lpoug. I pushed some changes, can you make sure to pull the latest version?

I'll try to review this in the next few days :)

lpoug · 2025-04-01T10:29:01Z

Hey there @drbenvincent. Just to be sure, are you waiting on anything on my side?

drbenvincent · 2025-04-01T14:40:38Z

Apologies for the delay! Just dropping in some review comments now

drbenvincent

Sorry about the slow review on this. My bad.

Overall this looks good. I've suggested some minor changes. Other than that the main thing is to update the tests to ensure this functionality remains working into the future.

Could you add new tests to test_integration_pymc_examples.py and test_integration_skl_examples.py. I imagine we can just test that we successfully get back a dataframe from calling result.get_plot_data on the experiments that you've implemented so far. You could optionally test that the contents of that dataframe is as expected, e.g. has the desired columns.

In theory an ultra pedantic person might want to test that we get an exception when calling the get_plot_data on experiments that.

Because this PR involves additional methods, can you run make uml. This should update the UML diagram that we include in CONTRIBUTING.md

Sorry again about the latency on this review.

causalpy/experiments/base.py

…ata_ols to _get_plot_data_ols, bayesian_plot to _bayesian_plot, and ols_plot to _ols_plot

lpoug · 2025-04-07T13:21:04Z

Sorry about the slow review on this. My bad.

Overall this looks good. I've suggested some minor changes. Other than that the main thing is to update the tests to ensure this functionality remains working into the future.

Could you add new tests to test_integration_pymc_examples.py and test_integration_skl_examples.py. I imagine we can just test that we successfully get back a dataframe from calling result.get_plot_data on the experiments that you've implemented so far. You could optionally test that the contents of that dataframe is as expected, e.g. has the desired columns.

In theory an ultra pedantic person might want to test that we get an exception when calling the get_plot_data on experiments that.

Because this PR involves additional methods, can you run make uml. This should update the UML diagram that we include in CONTRIBUTING.md

Sorry again about the latency on this review.

No problem at all! I was just starting to get worried about something that still needed to be done on my end 😅

Thank you for the reviews. I've added links to the commits directly in your comments. 6a6face (renaming of functions)

Regarding tests, I have added them in 97f0d79

I have not added anything yet to test that we get an exception when calling the get_plot_data on experiments for which it is not implemented. I'll try to take a moment to think about how to do so precisely.

Finally, I have updated the diagrams in 0edca77

Let me know if these changes look good or not or if you had anything else in mind!

drbenvincent

Looks good. I think we are very nearly there :) Thanks for adding in the tests.

It could be prudent to rename the *_hdi_lower and *_hdi_upper columns to include the numerical hdi_prob as a percentage. For example if hdi_prob=0.8 then the columns could be labelled as *_hdi_lower_80 and *_hdi_upper_80. That way there is much less scope to make a mistake like generating 80% HDI's but forgetting that and thinking that you generated 95% HDI's for example. I think that should be pretty simple to do in _get_plot_data_bayesian or _get_plot_data_ols.

Should also be pretty simple to resolve the merge conflicts - it's just the updated uml images as far as I can see.

Could you also update the docstring of the tests? At the moment they list out what is tested, so you can just flag up that it tests the functionality of plot_data. I'm not sure we'll carry on doing that in the future if the number of tests gets large, but let's keep up with it for the moment.

…ted tests accordingly, and updated tests' docstring

causalpy/tests/test_integration_pymc_examples.py

lpoug · 2025-04-08T11:17:59Z

I made the changes for dynamic naming of *_hdi_upper and *_hdi_lower and adjusted the tests accordingly (please see my comment above regarding tests). I also updated the docstrings of the tests as requested.
See commit 44d3870

Regarding the merge conflicts, to be honest I'm not sure what I need to do on my side? Could you enlighten me, please?

Thanks!

drbenvincent · 2025-04-16T09:10:57Z

Hi @lpoug. I've sorted the conflicting files and done a few small things. I'm just noticing that at the moment the rendered docs obscure the allowable args/kwargs. So we have this:

Which doesn't give the user much hint of what they can pass.

Same situation if we look at an actual experiment class:

I will have a quick play with this to see if we can expose the parameter information to the user in the docs. I think the easiest way is to revert a previous suggestion and make _get_plot_data_bayesian and _get_plot_data_ols public methods again. I experiment and make a commit shortly.

…add arviz to intersphinx_mapping

drbenvincent · 2025-04-16T09:30:31Z

So now the docs expose the kwargs to the user. It's not ultra clean because we wanted to hide the bayesian vs ols functions, but at least this way the user can find out what (if any) kwargs they can pass. Here are some examples:

Those functions are clickable, which takes you to this, for example

And the arviz.hdi is also clickable and takes you through to the external docs.

drbenvincent · 2025-04-16T09:35:43Z

Any other final changes @lpoug, or are you happy to merge this now?

Looks like there might just be a quick test coverage issue to deal with, but I'm sure we can get that done between us.

drbenvincent · 2025-04-17T08:49:54Z

Success with code coverage and all tests passing. Hope you don't mind that I carried this over the finish like @lpoug. Sorry the process was a little slow, it will be faster on your next PR 😀

lpoug · 2025-04-18T07:15:31Z

Amazing thank you @drbenvincent! Glad to have been a part of this 😄 Looking forward for the next, even more challenging one!

lpoug added 25 commits October 16, 2024 14:20

export plot data from plot utils

70110c0

add function to prepostfit class + fixes in plot_utils

fdc867f

generic get_plot_data in base.py and update prepostfit code to get hdi

deace7f

utility function to retrieve hdi and clean get_plot_data_bayesian

d7680f6

hdi_prob specification in get_plot_data_bayesian

1734486

tested for its and index alignment in recovering hdi

b79e6ee

removed unused library

3f813ac

export plot data from plot utils

bcf9e4f

add function to prepostfit class + fixes in plot_utils

8cf55ba

generic get_plot_data in base.py and update prepostfit code to get hdi

8284a86

utility function to retrieve hdi and clean get_plot_data_bayesian

c975987

hdi_prob specification in get_plot_data_bayesian

adb52b2

tested for its and index alignment in recovering hdi

6075f20

removed unused library

d0c2109

export plot data from plot utils

0b7fa36

generic get_plot_data in base.py and update prepostfit code to get hdi

2493b17

hdi_prob specification in get_plot_data_bayesian

1eab610

tested for its and index alignment in recovering hdi

b49a646

removed unused library

5521a07

fix diverging branch

e2263ea

export plot data from plot utils

bb7305a

generic get_plot_data in base.py and update prepostfit code to get hdi

7ac0432

hdi_prob specification in get_plot_data_bayesian

f045c96

removed unused library

6a92c4f

Merge branch 'plot-data' of github.com:lpoug/CausalPy into plot-data

aea64a0

lpoug marked this pull request as ready for review February 17, 2025 08:44

Merge branch 'main' into plot-data

9b6fcba

ran pre-commit checks

6dca884

drbenvincent added the enhancement New feature or request label Mar 3, 2025

drbenvincent requested changes Apr 1, 2025

View reviewed changes

causalpy/experiments/base.py Show resolved Hide resolved

causalpy/experiments/base.py Show resolved Hide resolved

causalpy/experiments/base.py Show resolved Hide resolved

lpoug added 3 commits April 7, 2025 14:52

renamed get_plot_data_bayesian to _get_plot_data_bayesian, get_plot_d…

6a6face

…ata_ols to _get_plot_data_ols, bayesian_plot to _bayesian_plot, and ols_plot to _ols_plot

added tests to get_plot_data for pymc and skl experiments

97f0d79

updated uml diagrams

0edca77

drbenvincent requested changes Apr 7, 2025

View reviewed changes

added dynamic naming for hdi columns in _get_plot_data_bayesian, upda…

44d3870

…ted tests accordingly, and updated tests' docstring

lpoug commented Apr 8, 2025

View reviewed changes

causalpy/tests/test_integration_pymc_examples.py Show resolved Hide resolved

drbenvincent added 3 commits April 16, 2025 09:21

Merge branch 'main' into pr/438

79c3ed1

run pre-commit checks

ae35d81

add comment to tests

5eaaec4

make get_plot_data methods public, links to functions in docstrings, …

d27aa89

…add arviz to intersphinx_mapping

replace NotImplementedError with pass

9af3bfb

drbenvincent self-requested a review April 17, 2025 08:09

drbenvincent added 2 commits April 17, 2025 09:34

revert previous change

3fd642e

add tests to detect NotImplementedError exception

da6c91d

drbenvincent approved these changes Apr 17, 2025

View reviewed changes

drbenvincent merged commit 7e0ca34 into pymc-labs:main Apr 17, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get plot data for prepostfit experiments #438

Get plot data for prepostfit experiments #438

lpoug commented Feb 7, 2025 •

edited

Loading

drbenvincent commented Feb 27, 2025

codecov bot commented Feb 27, 2025 •

edited

Loading

lpoug commented Mar 3, 2025

drbenvincent commented Mar 3, 2025

lpoug commented Apr 1, 2025

drbenvincent commented Apr 1, 2025

drbenvincent left a comment

lpoug commented Apr 7, 2025

drbenvincent left a comment

lpoug commented Apr 8, 2025

drbenvincent commented Apr 16, 2025

drbenvincent commented Apr 16, 2025

drbenvincent commented Apr 16, 2025 •

edited

Loading

drbenvincent commented Apr 17, 2025

lpoug commented Apr 18, 2025

Get plot data for prepostfit experiments #438

Get plot data for prepostfit experiments #438

Conversation

lpoug commented Feb 7, 2025 • edited Loading

drbenvincent commented Feb 27, 2025

codecov bot commented Feb 27, 2025 • edited Loading

Codecov Report

lpoug commented Mar 3, 2025

drbenvincent commented Mar 3, 2025

lpoug commented Apr 1, 2025

drbenvincent commented Apr 1, 2025

drbenvincent left a comment

Choose a reason for hiding this comment

lpoug commented Apr 7, 2025

drbenvincent left a comment

Choose a reason for hiding this comment

lpoug commented Apr 8, 2025

drbenvincent commented Apr 16, 2025

drbenvincent commented Apr 16, 2025

drbenvincent commented Apr 16, 2025 • edited Loading

drbenvincent commented Apr 17, 2025

lpoug commented Apr 18, 2025

lpoug commented Feb 7, 2025 •

edited

Loading

codecov bot commented Feb 27, 2025 •

edited

Loading

drbenvincent commented Apr 16, 2025 •

edited

Loading